Previs: a person-specific realistic virtual speaker
نویسندگان
چکیده
This paper describes a 2D realistic talking face. The facial appearance model is constructed with a parameterised 2D sample based model. This representation supports moderated head movements, facial gestures and emotional expressions. Two main contributions for talking heads applications are proposed. First, the image of the lips is synthesized by means of shape and texture information. Secondly, a nearly automated training process makes the talking face personalization easier, due to the use of mouth tracking. Additionally, lips are synchronized in real time with speech that is generated using a SAPI compliant text-to-speech engine.
منابع مشابه
A Communication Task in HMD Virtual Environments: Speaker and Listener Movement Improves Communication
In this paper we present an experiment which investigates the influence of animated real-time self-avatars in immersive virtual environments on a communication task. Further we investigate the influence of 1st and 3rd person perspectives and the influence of tracked speaker and listener. We find that people perform best in our communication task when both the speaker and the listener have an an...
متن کاملVirtual People: Capturing Human Models to Populate Virtual Worlds
In this paper a new technique is introduced for automatically building recognisable moving 3D models of individual people. Realistic modelling of people is essential for advanced multimedia, augmented reality and immersive virtual reality. Current systems for whole-body model capture are based on active 3D sensing to measure the shape of the body surface. Such systems are prohibitively expensiv...
متن کاملAlgorithms for Audiovisual Speaker Localisation in Reverberant Acoustic Environments
Innovative and future human-machine interfaces or video conference systems require knowledge of the speaker’s position for automatic beamformerand camera-steering purposes. To determine this position, acoustical as well as visual localisation techniques can be applied, and the aim of this project was to develop suitable algorithms for such an audiovisual speaker localisation. Furthermore, an ex...
متن کاملThe MultiLis Corpus - Dealing with Individual Differences in Nonverbal Listening Behavior
Computational models that attempt to predict when a virtual human should backchannel are often based on the analysis of recordings of face-to-face conversations between humans. Building a model based on a corpus brings with it the problem that people differ in the way they behave. The data provides examples of responses of a single person in a particular context but in the same context another ...
متن کاملStudy of Applicability of Virtual Users in Evaluating Multimodal Biometrics
A new approach of enlarging fused biometric databases is presented. Fusion strategies based upon matching score are applied on active biometrics verification scenarios. Consistent biometric data of two traits are used in test scenarios of handwriting and speaker verification. The fusion strategies are applied on multimodal biometrics of two different user types. The real users represent two bio...
متن کامل